Generating Classification Rules According to User's Existing Knowledge
نویسندگان
چکیده
An important problem in applying classification rule induction techniques to practical applications is how to produce rules that are related to the user’s existing knowledge about the domain and his/her current interests. Such rules are interesting to the user, and also easily understood and trusted by the user. They can enhance the existing knowledge of the domain and be relied upon in real-world performance tasks. Past research and applications have shown this to be a crucial requirement in many real-life applications. Existing techniques for dealing with this problem typically use sophisticated methods to bias the rule induction process in order to produce rules that are consistent with the existing knowledge. In this paper, we propose a novel and simple approach. It only needs to pre-process the data using the user’s existing knowledge. It does not make any modification to the rule induction technique. Practical applications have shown that this simple approach is surprisingly effective and flexible. It demonstrates that to obtain useful results, we do not necessarily need to use sophisticated techniques. Sometimes simple approaches may just be sufficient.
منابع مشابه
Discovering Conforming and Unexpected Classification Rules
One problem in applying machine learning and knowledge discovery techniques to solve real-world problems is how to incorporate the user's concepts about the application domain into the learning process to discover interesting rules to the user. Rules are interesting if they are useful and/or provide new knowledge to the user. Interesting rules are subjective because they depend on the individua...
متن کاملGENERATING FUZZY RULES FOR PROTEIN CLASSIFICATION
This paper considers the generation of some interpretable fuzzy rules for assigning an amino acid sequence into the appropriate protein superfamily. Since the main objective of this classifier is the interpretability of rules, we have used the distribution of amino acids in the sequences of proteins as features. These features are the occurrence probabilities of six exchange groups in the seque...
متن کاملKnowledge discovery from patients’ behavior via clustering-classification algorithms based on weighted eRFM and CLV model: An empirical study in public health care services
The rapid growing of information technology (IT) motivates and makes competitive advantages in health care industry. Nowadays, many hospitals try to build a successful customer relationship management (CRM) to recognize target and potential patients, increase patient loyalty and satisfaction and finally maximize their profitability. Many hospitals have large data warehouses containing customer ...
متن کاملKnowledge discovery from patients’ behavior via clustering-classification algorithms based on weighted eRFM and CLV model: An empirical study in public health care services
The rapid growing of information technology (IT) motivates and makes competitive advantages in health care industry. Nowadays, many hospitals try to build a successful customer relationship management (CRM) to recognize target and potential patients, increase patient loyalty and satisfaction and finally maximize their profitability. Many hospitals have large data warehouses containing customer ...
متن کاملA Margin-based Model with a Fast Local Searchnewline for Rule Weighting and Reduction in Fuzzynewline Rule-based Classification Systems
Fuzzy Rule-Based Classification Systems (FRBCS) are highly investigated by researchers due to their noise-stability and interpretability. Unfortunately, generating a rule-base which is sufficiently both accurate and interpretable, is a hard process. Rule weighting is one of the approaches to improve the accuracy of a pre-generated rule-base without modifying the original rules. Most of the pro...
متن کامل